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Abstract. Given a continuous function from Euclidean space to the real line, we an- 
alyze (under some natural assumption on the function), the set of values it takes on 
translates of lattices. Our results are of the flavor: For almost any translate, the set of 
values is dense in the set of possible values. The results are then applied to a variety of 
concrete examples, obtaining new information in classical discussions in different areas in 
mathematics; in particular, Minkowski's conjecture regarding products of inhomogeneous 
forms and inhomogeneous Diophantinc approximations. 



1. Introduction 

Given a continuous function F : IR d — > M, it is a natural question in number theory, to 
try and analyze the set of values F takes on points of a lattice in M. d . Up to a linear change 
of variable, this question is of course equivalent to analyzing the values F takes at integer 
points. We shall be interested in an inhomogeneous variant of this discussion; we try to 
analyze the set of values F takes on grids, that is on translated lattices. We approach 
this discussion from a dynamical point of view which leads us to impose some natural 
assumptions on the function F under consideration. We present a variety of concrete 
examples in § [2] and applications to classical discussions in Diophantine approximations 
and the geometry of numbers in § [31 The main tools we develop to derive our results, are 
the Mixing and the Coset lemmas, appearing in § HI The discussion in § H] is concerned 
with closures of certain random sequences on the (i-torus. It is independent of the rest of 
this paper and may be of independent interest on its own. 

1.1. Basic notions. The basic objects we shall work with are lattices and grids. As we 
wish to exploit dynamical methods, it is most convenient to present spaces, the points 
of which are lattices and grids. We set G = SL<f(R),r = SLd(Z), and let Xd = G/T. A 
point gT G Xd will be denoted by g. The space Xd can be identified with the space of 
unimodular (i.e. of covolume 1) lattices in M. d . The identification is defined by associating 
to g G X d the lattice spanned by the columns of g. We let Y d — (G tx M. d ) / (Fix if) . The 
space Yd is identified with the space of unimodular grids in M. d , where by a grid we mean a 
set of the form g + v = {w + v : w G g} . Points of Yd are denoted by g + v where g G G and 
v G K d . We endow Xd, Yd with the quotient topology, induced from the usual topology on 
G and G x M. d respectively. In fact, Xd, Yd are smooth manifolds and inherit a complete 
Riemannian metric from the corresponding group. It will be convenient to interchangeably 
think of the points of Xd, Yd, as subsets of M d , or as points in the corresponding manifold. 



* The author acknowledges the partial support of ISF grant number 1157/08. 

1 



2 



URI SHAPIRA 



We shall denote points of X d , Y d by lower case letters, x, xi, y, y' etc. It is a good exercise 
for the reader who is not familiar with the topologies introduced above, to work out the 
meaning of two lattices (resp. grids) being close to each other, when thinking of them as 
subsets in R d . This boils down to saying that in a very large box, centered at the origin, 
the corresponding two sets are close in the usual sense. 

There is an obvious projection ir : Y d — > Xd given by n(g + v) = g. The fiber 7r _1 (x), for 
x G Xd, is naturally identified with the torus R d /x. Note also that Xd naturally embeds 
in Yd- For a continuous function F : R d — > R, we define for each y G Yd the value set 

V F (y) = {F(v):vey}. (1.1) 

In general, it is of interest to analyze for a fixed grid y G Y d , the value set Vp{y). In 
particular, one would like to answer questions such as: Is the value set dense, discrete, or 
does it contain zero in its closure. We make the following definition which will be of most 
interest to us in this paper. 

Definition 1.1. Consider a fixed continuous function F : R d — > R. 

(1) A grid y G Y d , is DV (dense values), if the value set, Vp(y), is dense in the image 
F(R d ). 

(2) A lattice x G Xd is grid-DV, if all its grids are DV. 

(3) Given a lattice x G X d , and a probability measure /i, on the torus tt~ 1 (x), we say 
that x is /i-almost surely grid-DV, if //-almost any grid y G 7r _1 (x), is DV (we 
sometime express this by saying that x is almost surely grid-DV with respect to 
Ai). 

In general, it is a hard problem to decide for a specific function F, whether or not there 
exist grid-DV lattices, although in some concrete examples the answer is known (see §|2J). 
On the other hand, the existence of almost surely grid-DV lattices (with respect to some 
natural family of measures) is guaranteed, once some reasonable assumptions on F are 
imposed. In this paper we find sufficient conditions for a lattice x G X d to be almost 
surely grid-DV with respect to some measures which we now turn to describe. 

1.2. Haar measures of subtori. Let x G X d be given, we say that a subspace U < R d , 
is rational with respect to x, if x D U is a lattice in U. The rational subspaces are in 
one to one correspondence with the closed connected subgroups of the torus 7r _1 (a;). We 
refer to these as subtori and denote the subtorus corresponding to a rational subspace, 
U, by U + x. The Haar probability measure on such a subtorus is denoted by Xjj. For 
v G R d and a subtorus U + x, we denote by v + (U + x) the coset of v with respect to 
the subtorus. The translation of the Haar measure Xu by v, supported on the coset, is 
denoted by \ v+ u- When U = R d , we sometime refer to Xu as Lebesgue measure. 

1.3. The invariance group. Let 

H F = {geG:Fog = F}, 
H F = H° F . 
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Here H^, denotes the connected component of the identity in Hp. The group Hp will 
be referred to as the invariance group of F . It is a closed subgroup of G and in most 
interesting cases it is (the connected component of the identity of ) an algebraic group. 
This happens when F is a polynomial, but also in some other cases too. We say that F 
is noncompact if Hp is noncompact . As our approach is dynamical, we shall only discuss 
the noncompact case. 

There are natural actions (by left translation) of G on Xd, Yd. These actions commute 
with 7T. When thinking of the points of Xd, Yd as subsets of M , these actions are induced 
by the linear action of G on ~R d . The importance of the invariance group to our discussion 
is that it leaves the value sets invariant; for any y G Yd and any h G Hp, 



The following lemma illustrates how dynamics comes into the game. The reader should 
deduce it from equation (11. 2p . the continuity of F, and the topology of Y d : 

Lemma 1.2 (Inheritance). Let y,y G Yd be such that yo G H F y. Then Vp(y ) C Vp(y). 
In particular, if y is DV, then so is y. If y has a discrete value set, then so does yo. 

As SLd(Z) is a lattice in SL<j(R), the space Xd carries a G-invariant probability measure. 
Similarly, Yd carries a G x M d -invariant probability measure. We shall refer to both these 
measures as the Haar measures. It is not hard to show that the Haar measure on Yd 
disintegrates to the Lebesgue measures on the fibers {7r _1 (x) : x G X d }, with respect to 
the Haar measure on Xd- The Howe-Moore theorem asserts in this case, that the en- 
actions on Xd, Yd are mixing with respect to these measures. As we assume that Hp is 
noncompact, it acts mixingly and in particular, ergodically on both spaces. It follows in 
particular, that for almost any grid y G Yd, Hpy is dense in Yd. This gives us the following 
immediate corollary of the inheritance lemma 

Corollary 1.3. Almost anyy G Yd is DV. Moreover, almost any lattice, x G Xd, is almost 
surely grid-DV with respect to the Lebesgue measure of the full torus 7r~ 1 (x). 

In this paper we wish to point out a fairly general connection between the dynamical 
behavior of a lattice x G Xd under the action of the invariance group, and the value set, 
Vp(y), along a fiber 7r _1 (x). Our aim is to sharpen Corollary 11.31 and to develop better 
understanding of the set of almost surely grid-DV lattices. 

Before ending this introduction, let us make the following definitions which we need in 
order to state our results. We say that a sequence in a topological space is divergent, if it 
has no converging subsequences. An orbit H F x, is said to be divergent if for any divergent 
sequence h n G Hp, the sequence h n x is divergent in Xd (i.e. if the orbit map is proper). 

Definition 1.4. A continuous function F : M d — > IR is nondegenerate if for any nontrivial 

subspace U <R d and any grid y G Y^, one has 



A typical example of a degenerate function, which we wish to avoid, is a polynomial 
with coefficients in Z, which does not depend on one of the variables (the simplest example 



V F (y) = Vp(hy). 



(1.2) 



{F{u + v) : u G U, v G y} = F(R d ). 



(1.3) 
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is F : M 2 -> R, given by F(vi,V2) = V\). In this case, choosing the grid to be simply 7L d 
and U to be the line corresponding to the variable that does not appear in F, we see that 
the set in fll.3j) . is discrete. 

Definition 1.5. A sequence h n G G is said to be almost finite with respect to a subspace 
[/ < M d , if there exist a a sequence e n G C7 with e n — >■ e, such that the set of restrictions 
{e n h n \u} is finite. 

Note that any diverging sequence, h n , is not almost finite with respect to R d . Also, if 
h n are diagonalizable, and the eigenvalues approach either or oo, then h n is not almost 
finite with respect to any nontrivial subspace. 

2. Examples and results 

2.1. Main theorem. The following theorem is a simplified version of our main result. 
This version is sufficiently strong for most of our applications. 

Theorem 2.1. Let F : M d — > IR be nondegenerate and noncompact. Let x G X^ be a 
lattice with a nondivergent Hp-orbit and let A be the Lebesgue measure of the full torus 
n~ l {x). Then x is X-almost surely grid-DV. 

Theorem 12.11 follows from the following theorem which is the main result in this paper. 
It is proved in § [5j 

Theorem 2.2. Let F : IR d — > IR be nondegenerate and noncompact. Let x G be a 
lattice with a nondivergent Hp- orbit and let X w +u be the translation of the Haar measure 
supported on the subtorus U + x by a vector w G M d . Then, if there exists a divergent 
sequence h n G Hp, such that h n x converges and h n is not almost finite with respect to U, 
then x is \ w +u -almost surely grid-DV. 

In some examples (as will be seen below), the fact that the sequence h n is not almost 
finite follows automatically from properties of the invariance group. We now turn to apply 
these results to a variety of concrete examples. 

2.2. Rank one. The following family of examples is particularly relevant to Diophantine 
approximations (see §3.21) . Let m,n be positive integers such that m + n = d. Let 
us denote vectors in M. d as column vectors (x, y)* (here t stands for transpose), where 
x G W n ,y G R m . Let P n , m : l d -> 1 be defined by P n>m (x,y) = ||x||2 ||y||™ (where 
we denote here by ||-|| 2 , the Euclidean norm on the corresponding space). Denoting the 
invariance group by H ntTn , it is not hard to see that H n m = SO(n) x SO(m) x {a nim (t)} tgR , 



It is not hard to see that P n ^ m is nondegenerate and noncompact, hence Theorem 12.21 
applies. In fact, we have the following theorem. 



where 




(2.1) 
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Theorem 2.3. Let x £ Xj be a lattice with a nondivergent H nm -orbit. Then for any 
subspace {0} ^ U < ~R d , rational with respect to x, and any w £ R d , x is X w+ u-almost 
surely grid-DV. 

Proof. Let X w +u be a measure as in the statement. As the orbit H n>m x is nondivergent, 
there exists a diverging sequence h n £ H F , such that h n x converges. In fact, as H nm is 
the product of a compact group with the one parameter group a n>m (t), we may assume 
that h n = a n>m (t n ) for some t n — > ±oo. As the eigenvalues of a n ^ m (t n ) approach or oo, 
it follows that h n is not almost finite with respect to U. Theorem 12.21 applies and we 
conclude that x is A^+^-almost surely grid-DV. □ 

In §3.21 we apply Theorem 12.31 to Diophantine approximations and in particular to 
derive new results on nonsingular forms. We also remark here that because H n m is of 
real rank 1, for any lattice x £ Xj, there always exist grids which are not DV; that is, 
there are no grid-DV lattices in this case. This result was proved first by Davenport j9] 
for the case d — 2,n, m — 1, while recently, Einsiedler and Tseng proved (for general d, 
n — d — l,m = 1) [ID] that the set of such grids is of full Hausdorff dimension, and in 
fact, a winning set for Schmidt's game (see also [5] for generalizations). 

2.3. The product of linear forms. Let N : M. d — > K be the function N(x) = Yli x i- 
We denote the invar iance group Hn, by A. It is the group of diagonal matrices with 
positive diagonal entries in G. It is not hard to see that N is nondegenerate and noncom- 
pact. Hence, Theorem 12.21 and Theorem 12.11 apply and we know that any lattice with a 
nondivergent A-orbit is almost surely grid-DV with respect to the Haar measure of the 
full torus. We shall see in the next section that this result has significant implications to- 
wards Minkowki's conjecture. It is worth noting here that a classification of the divergent 
A-orbits, due to Margulis, is given in [21]. It is proved there that Ax is divergent, if and 
only if, there exists a £ A, such that the lattice ax is contained in Q d . There are many 
extra assumptions one can impose on a lattice x to ensure that x is almost surely grid-DV 
with respect to any nontrivial measure of the form \ w +u- The following two corollaries 
are examples of such. The first is proved in § [5] 

Corollary 2.4. Let x £ X^ be a lattice which does not contain any vector lying on the 
hyperplanes of the axes (that is the hyperplanes orthogonal to the standard basis vectors). 
Then for any subspace {0} ^ U < M. d , rational with respect to x, and for any w £ IR d , x 
is X w+ u- almost surely grid-DV. 

Corollary 2.5. Let x £ Xd be a lattice such that there exists a sequence 

a n = diag [e^ , . . . , e** J £ A, 

such that a n x converges, and for any 1 < i < d, tf 1 ^ diverges. Then x is almost surely 
grid-DV with respect to any nontrivial measure X w +u 

Proof. This follows immediately from Theorem 12.21 as under our assumptions, the se- 
quence a n is not almost finite with respect to any nontrivial subspace. □ 
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Thanks to the classification of divergent A-orbits, we have the following theorem which 
shows what happens for lattices with divergent orbits. 

Theorem 2.6. Let x G Xd be a lattice with a divergent A-orbit. Then Vn{v) is discrete 
(and moreover Vjv(y) = Vn(v) ), for any grid y G n^ 1 (x). 

Proof. As the value set does not change along the orbit, we conclude from the classification 
of divergent A-orbits, mentioned above, that we may assume that x C Q d . In fact, it is 
not hard to see that the validity of the theorem is stable under commensurability; that 
is, if x, x' are commensurable lattices (i.e. x D x' is of finite index in both), then the sets 
Vn(z/) are discrete, for any y G 7r _1 (x), if and only if the same is true for any y G 7r _1 (x'). 
This enables us to assume that x = 7L d . So, we are left to verify that for any vector 
v eR d , the set 

V N (v + Z d ) = I f[(ni + v i ):n l eZ 



is discrete. To prove this, let ni( n !^ + v i) be a converging sequence of elements of 
Vn(v + Z d ), and assume by way or contradiction that its elements are distinct (this rules 

(k) 

out the possibility of having n\ + Vi = for more than one k). For each i, the sequence 
(nj + vi), is discrete. Then we are able to take a subsequence so that for each i, along 
the subsequence, either [n^ + vi) is constant and nonzero, or it diverges. As we assume 
the original sequence is converging, we must have that the subsequence is constant from 
some point which contradicts our assumption. □ 

In contrast to the the situation presented in the previous subsection (Theorem 12.31) . 
where nondivergence of the orbit of a lattice under the invariance group was sufficient to 
ensure that the lattice is almost surely grid-DV with respect to any measure of the form 
A w+ c/, we work out the following example which shows the existence of a lattice x G X 3 , 
with a nondivergent A-orbit, and a nontrivial subspace U < M 3 , rational with respect to 
x, such that x is not A^-almost surely grid-DV. In fact, we shall see that the value set 
Vat(x + u) is discrete for any u G U 

Example 2.7. In this example, as we will mix the dimensions, we denote our function 
N and its invariance group A in dimension d by Nd and Ad respectively. Let go G SL2QR) 
be such that the lattice (?o has a compact orbit in X 2 , under the action of the group A 2 = 
{diag (e 4 , e _i )}. It is well known that this is equivalent to saying that the value set Vat 2 (^o) 

is discrete and does not contain zero. Let g = I I ° j G SL 3 (M) (here the zeros stand 



. g 

for the corresponding row and column zero vector in M 2 ), and let U = {(*,0,0) 4 G M 3 }. 
Then it is clear that for any grid y = g + u, u G U, the value set Vjsr 3 (y), is discrete. 
Indeed, if u = (a, 0, 0)*, then 

V N3 (g + u) = {(n + a)N 2 (w) : n G Z, w G g } , 

which is clearly discrete as VKr 2 (po) is- 
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To see how this fits with Theorem 12.2} note that if 

a n = diag(e 1 , e 2 ,e (l + 2 >) 

is a sequence in A3, satisfying that a n g converges in X3, then it is easy to see that ti 
must converge, and so the sequence a n is almost finite with respect to U . 

We remark here that for d > 3, the fact that the invariance group is of higher rank, 
enabled the author of the present paper to prove [20] the existence of grid-DV lattices. In 
this case such lattices are also known as GDP lattices (grid-dense-products). 

2.4. Indefinite quadratic forms. Let p, q > be integers such that d = p + q > 3. 
Let F be the indefinite quadratic form given by F(x) = Yli x f ~ Ylp+i x i- ^ n this case 
Hp = SO(jo, q). It is not hard to see that F is nondegenerate (in the sense of Definition [L4]) 
and noncompact, hence Theorem 12.21 applies. In fact, as Hp is generated by unipotents, 
it follows from [16] that Hp has no divergent orbits and so the theorem applies for any 
lattice. 

The fact that Hp is generated by unipotents allows one to obtain much sharper results 
than Theorem 12.21 In fact, one has the following theorem and corollary which follow from 
Ratner's orbit closure theorem and Lemma [TT2"1 We do not elaborate on the arguments as 
this case is well understood and sharper results than those stated here are available (see 
for example [17]). 

Theorem 2.8. Let y G Yd be given. If the orbit Hpy is closed, then the value set Vp(y) 
is discrete. Otherwise, the value set is dense. 

It is not hard to see that in this case, a grid y 6^ has a closed Hp-orbit, if and only if, 
7r(x) G Xd has a closed orbit, and y is rational] that is to say y = x + v for v G SpanQ^. 
So we have a full understanding: 

Corollary 2.9. Let x G Xd be given. If the orbit, Hpx, is not closed, then x is grid-DV. If 

the orbit, Hpx, is closed, then rational grids have discrete value sets, while irrational grids 
are DV. In any case, x is almost surely grid-DV with respect to any nontrivial measure 

3. Applications 

We now apply some of the above theorems. In §3.1l we apply Theorem 12 .11 to derive new 
information towards Minkowski's conjecture. In §3.21 we apply Theorem 12.31 to generalize 
some results on inhomogeneous approximations and provide a partial answer to a problem 
posed in [2"2"] . 

3.1. Minkowski's conjecture. We shall use the notation of §2.31 The following con- 
jecture is usually attributed to Minkowski and hence named after him. It has remained 
open for over a century. 

Conjecture 3.1 (Minkowski). Let d > 2 and iV : R d R be the function N(x) = l\t x i- 
Then, for any y G Y d one has V N (y) fl [-2~ d , 2~ d ] ^ 0. 
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To this date, Minkowski's conjecture is verified up to dimension 7. For more information 
about the interesting history and recent developments we refer the reader to the recent pa- 
pers [IB] . [12], where it is proved for dimensions 6 and 7. We say that a lattice, x G X d , sat- 
isfies Minkowski's conjecture, if any one of its grids satisfies it. In [3], Bombieri proved that 
for d > 5, for any lattice x G Xd, the set {y G 7r _1 (x) : y satisfies Minkowski's conjecture}, 
has Lebesgue measure > 2~^~. The results of Narzullaev [19], allows one to strengthen 
Bombieri's result, still obtaining a poor lower bound. We strengthen this estimate to the 
best possible from the measure theoretic point of view and prove 

Theorem 3.2. For any d > 2 and any lattice, x G Xd, the set 

[y G 7r _1 (x) : y does not satisfy Minkowski's conjecture} , (3-1) 
is of Lebesgue measure zero. 

Proof. Let x G Xd be given. We divide the argument into cases. If the orbit Ax, of 
the invariance group, is nondivergent, Theorem 12.11 implies that almost any grid y of x 
is DV. In particular, VN{y) H [— 2~ d ,2 ] ^ and the claim follows. If the orbit Ax is 
divergent, then thanks to the characterization of divergent orbits given in [21], we see that 
there exists a G A, such that the lattice ax has a basis consisting of vectors with rational 
coordinates. We refer to such lattices as rational. It is well known, (see [IS]), that rational 
lattices satisfy Minkowski's conjecture, hence the set in ( 13. ip is in fact empty. □ 

We note here that if we impose on the lattice x the further assumptions of Corollar- 
ies I2.4[ I2.5[ then the above theorem can be strengthened to say that the set of possible 
counterexamples to Minkowski's conjecture in the torus ir~ l (x), has measure zero with 
respect to any measure of the form X w+ u. 

3.2. Inhomogeneous approximations and nonsingular forms. In this section we 
apply Theorem 12.31 to deduce some new results about nonsingular forms. We fix some 
positive integers n, m and consider a matrix A G Mat nxrn (M). In Diophantine approxima- 
tions, A is usually regarded as defining a system of n linear forms in m variables, hence 
we shall refer to such a matrix as a form. In first reading it might be useful to take m = 1 
and n arbitrary, hence A G lR n stands for a column vector, or even n = m = 1, and then 
A stands for a real number. 

For any dimension k, let (■) denote the distance from Z fc in R fc , i.e. for v G 

(v) = min — q|| 2 : q G Z fc j . 

Note that for v G M fc , (v) can be thought of as the distance in the torus T k = M fc /Z fc , 
from v to 0. Here and in the sequel we abuse notation freely and denote points on the 
fc-torus the same as their representatives in M fc . 

Definition 3.3. A form A G Mat nxm (M) is singular, if for any 5 > 0, for any large enough 
N G N, one can find q G Z m such that 

< ||q|| 2 < N and (Aq) < — g-. (3.2) 



GRIDS WITH DENSE VALUES 



9 



Following Baker we say that the form A is highly singular, if there exists e > such 
that for all large enough iVeN, one can find q G Z m such that 

||q|| 2 < N and (Aq) < (3.3) 



It is clear that a highly singular form is singular. By applying Theorem 12.31 we shall 
deduce some results about nonsingular forms. We shall prove Theorem 13.71 but for the 
meantime let us formulate a restricted version of it in the form of the following theorem: 

Theorem 3.4. Let A G Mat nxm (lR) be a nonsingular form. Then, for Lebesgue almost 
any x G R n , the following set is dense in [0, oo), 

{\\q\\™(Aq+x) n :qeZ m }. (3.4) 

In particular, the following statement holds 

For Lebesgue almost any x G M. n , liminf ||<7|| 2 n (Aq+ x) = 0. (3-5) 



Remarks 3.5. (1) It follows from the main theorem in [5J, that for any nonsingular 
form A G Mat nxm (M), the following statement holds 

{0 for w < — 
c m (3.6) 

oo for w > — . v ' 



The innovation in Theorem 13 A\ is that the liminf in the critical exponent ^, 
equals zero (and in fact much more, namely, the set in (13 .4p is dense). 
(2) The above theorem generalizes a series of results. When one takes n = m = 1, 
hence A is a real number, Kurzweil proved [2] that for almost any A, (I3.5P holds. 
Not long ago Kim proved [13] that Kurzweil's result holds for any irrational A G R 
(which is the same as being nonsingular in these dimensions). Tseng reproved 
Kim's result [22], and raised the following question, aiming to generalize Kim's 
result to higher dimensions: 

Question (Tseng). For m = 1, hence A G M. n is a column vector, is it true that 
if A does not lie in a rational hyperplane, then (13. 5p holds. 

Galatolo and Peterlongo [UJ constructed a (singular) vector in M. 2 , giving a 
negative answer to Tseng's question. Nevertheless, Theorem 13.41 tells us that the 
answer to Tseng's question is positive for nonsingular vectors (a negative answer 
to Tseng's question could also be presented by a vector A G lR n which does not 
lie in a rational hyperplane such that the transposed form A 1 is highly singular. 
To see this, note that from the main result in [6J, it follows that in this case the 
value of the liminf in (13. 5p equals oo for almost any x). 

We turn now to the statement and proof of Theorem 13.71 This theorem implies The- 
orem 13.41 when applied with respect to Lebesgue measure but is in fact considerably 
stronger. Nevertheless, it is a simple application of Theorem 12.31 In order to state this 
theorem, we need to link the above discussion with the discussion of §2.2[ namely the 
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discussion of values of forms on grids. Let d = n + m and recall the notation of §2.21 
We write vectors in R d as (x,y)*, where x G W 1 and y G JR m . We let P n , m : R d -»■ R be 
the function given by P nm (x, y) = 1 1 ^ ■ ||y||™, and denote the invariance group of it by 
H n>m . Recall also that H n , m = SO n (R) x SO m (M) x {a n . m (t)} teR , where a n , TO (t) is given 
by <|27T|> . For a form A G Mat„ Xm (R) we denote 



where I n , I m denote the identity matrices of the corresponding dimensions and 0, the zero 
matrix in Mat mxn (R). We denote the lattice qa G Xg, spanned by the columns of g^, by 
xa- The following result of Dani is well known (see Theorem 2.14 in [7]), and furnishes the 
link between the nondivergence condition in Theorem 12.31 and the nonsingularity property. 
We include the proof for the completeness of our presentation. 

Lemma 3.6. A form A G Mat„ xm (M) is singular, if and only if, the orbit H n ^ m XA is 
divergent in X^. 

Proof. Assume that the orbit H nym XA is divergent. As a njm (t) is cocompact in H nm , this 
is equivalent to saying that the a„ )m (t)-orbit of xa is divergent. Fix 5 > 0. We need to 
show that for all large enough JV 6 N, one can solve (13. 2p . The divergence of the orbit 
implies, by Mahler's compactness criterion, that there exists T G M, such that for any 
t > T , the lattice a n>m (t)xA contains a nonzero vector of length < 5. I.e. for any t > T , 
there exist p G Z n , q G Z m , not both zero, such that 



Note that it follows that q must be nonzero. The vector composed of the first n coordinates 
of the vector appearing in (13.81) equals e mi (Aq + p), while the one composed of the last 
m coordinate equals e~ nt ||q|| 2 - Both these vectors have length < 5, hence we conclude 
that ||v4q + p|| 2 = (Aq), and that 



Let iV > e be given and choose t > T so that N = e nt . We conclude from the above 
that there exist a nonzero q G Z m satisfying ||q|| 2 < Se nt < N, (Aq) < = —4r, and so 
A is singular as desired. 

We leave the other implication as an exercise to the interested reader, as we shall only 
use in this paper the implication proved above, i.e. that if A is a nonsingular form, then 
H n ^ m %A is nondivergent. □ 

Before stating Theorem 13.71 we introduce some more notation. Let p : M. d — > M. n be the 
projection p(x, y) = x. Abusing the notation introduced in §1.2[ we denote for a subspace 
U < R d and a vector wo G M, d , by \ Wo +u, the natural [/-invariant measure supported on 
the affine subspace w + U. 




(3.7) 




(3.8) 



(Aq) < 5e~ 
< ||q|| 2 < 5e ni 



(3.9) 
(3.10) 
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Theorem 3.7. For any nonsingular form A G Mat nxm (IR) ; and for any measure of the 
form [i = p*(\ wo +u), where U is a nontrivial subspace ofR d , rational with respect to the 
lattice xa , and w is arbitrary, for fi- almost any x G W 1 , 

the set {|| q\\™ (Aq+ x) n : gG Z m } is dense in [0,oo). (3.11) 

Proof. Theorem 12.31 together with Lemma [3.61 imply that xa is A to0 +j/-a.s. grid-DV. This 
means that for A^^-almost any w = (x, y)* G M d , the grid xa + w is DV. Calculating 
the value set we get 

Vp n Jx A + w) = {||q + y\\™ \\Aq + x + pf 2 : p G Z", q G Z m } . (3.12) 

Given w = (x, y)' such that xa + w is DV, for any 7 G [0, 00), there are appropriate 
sequences Pj, q, satisfying limj ||q i + y|| m \\Aq { + x + pj| n = 7 (and the convergence is not 
trivial). Then ||qj|| 2 must go to infinity, which in turn implies that ||Aq^ + x + pj 2 — > 
and in particular, || ^4q i + x + p ?; || 2 = (Aq t + x). It follows that linij ||qj|| m (Aq { + x) n = 7 
as well. We conclude that 

p- 1 ({x G R n : fl3"TT]) dose not hold}) C {w = (x, y)' G R d : the grid x A + w is not DV} 

As the right hand side is A„, 0+ c/-null, the left hand side is too and by definition, for 
P*(A Wo +c/)-almost any x, (13. lip holds. □ 

The above theorem, when applied with U = M. d , gives Theorem I3.4[ as mentioned 
earlier. Let us demonstrate the strength of Theorem 13.71 with the following 

Corollary 3.8. Take m = 1 and n arbitrary. For any nonsingular vector v G M™ and for 
Lebesgue almost any t G R the following set 

{\q\ ((q + t)v) n : q e Z} , 

is dense in [0, 00). 

Proof. Apply Theorem 13.71 with wq = and U being the one dimensional subspace 



V = { t[ v x ) G R n+1 : / e 



□ 



4. The Mixing and the Coset lemmas 

In this section we study the following question. Given a sequence of endomorphisms 
of the d-torus, 7 n : T d — > T , and a Haar measure, Xjj, of a subtorus (see the notation of 
§1.21) . what can one say about the closure {^ n v ■ n G N}, for A^-almost any point v G T d . 
We shall see in the Coset lemma below, that unless an obvious obstacle is present, this 
closure must contain a coset of a nontrivial subtorus. In the course of proving the Coset 
lemma, which is the goal of this section, we shall prove the Mixing lemma, stated below, 
which is of independent interest on its own. Both the Coset and the Mixing lemmas are 
stated in somewhat greater generality than we actually need in practice. This is dictated 
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by the argument we use to derive the Coset lemma, which is inductive, hence we need to 
let the dimensions of the tori to be arbitrary. 

Given a compact metric space X, the space of Borel probability measures on it, V(X), 
is compact with respect to the weak* topology. When considering measures, we shall only 
refer to the weak* topology. If 7 : X — > Y is a measurable map between compact metric 
spaces, it defines a map 7* : V(X) — > V(Y), given by ■y*(r])(A) = rjij^^A)), for any Borel 
measurable A C Y and 77 G V(X). The following definition is new as far as we know. 

Definition 4.1. Given two probability measures rj G V{X\v G V(Y), and a sequence 
of measurable maps 7„ : X — > Y, we say that 7„ mixes f] to u, if for any absolutely 
continuous probability measure rf <C r/, the sequence ('Jn)*v' ~~ v we& k* (we sometime 
say that (7«)*?7 converges mixingly to v). 

Note that the above definition is equivalent to the requirement that for any g G L l (rj), 
and any continuous / G C{Y) one has 



To explain the terminology, recall that given v G V(X) and a sequence of measurable maps 
7„ : X — )■ X, preserving v, the sequence 7 n is said to be a mixing sequence with respect 
z/, if for any two measurable sets A,B C X, one has z/(7„(A) PI B) — >■ z/(A)i/(i?). This 
is easily seen to be equivalent to the fact that 7„ mixes v to itself using the terminology 
introduced above. 

We note here that the fact that 7™ mixes rj to v is much stronger than the fact that 
(jn)*V v - m particular, one consequence that we will be interested in is the following 
simple lemma. 

Lemma 4.2. If 7„ : X — >■ Y mixes r\ to v , then for rj-almost any x G X, the closure 
{•y n x : n <EN} contains supp(z/). 

Proof. Assume that the conclusion is false. Then, there must exist a bump function 
/ G C(Y) (i.e. < / < 1), with J Y fdv > 0, and a measurable set C X, with r/(fi) > 0, 
such that for any x G Q, the sequence 7„x never visits U = {y EY : f(y) > 0}. Let 
g G L 1 ^) be the characteristic function of Q. We now see that the left hand side of (14.11) 
is constantly zero, while the right hand side is positive. A contradiction. □ 

Lemma 4.3 (Mixing lemma). Let U < lR dl be a subspace, rational with respect to Z dl . 
Let 7„ : T dl — > T d ' 2 be a sequence of homomorphisms induced by the matrices 7 n G 
Mat^xda^). Assume the sequence ( r y n )*^u converges weak* to n G ^(T^ 2 ). Then, there 
exists a rational subspace V < lR d2 , such that \i = \y, and furthermore, the sequence 7„ 
mixes Xu to Xy, if and only if, there are no integer vectors m G Z d2 and 7^ k G U , 
satisfying 7^771+ k G If- 1 , for infinitely many n's. 

Before turning to the proof we make some clarifying remarks and work out some ex- 
amples. First, the fact that fi is a Haar measure is not the essence of the lemma. It is 
an easy exercise to prove that any weak* limit of Haar measures is again a Haar measure 




(4.1) 
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in this context (this fact is implicitly used in the proof below). The true content of the 
lemma is the fact that in the absence of the obvious obstacle to mixing - the existence of 
m, k 7^ such that 7* m + k <G U x infinitely often - 7„ actually mixes Xu to Xy, which 
is significantly stronger than the convergence of (7 n )*Aj/ to Ay. Second, note that given 
any sequence 7 n : T dl — > T d2 , of homomorphisms, and a Haar measure Xu of a subtorus 
U + Z dl C T dl , one can always assume, after passing to a subsequence, that (7 n )*A;y con- 
verges by the compactness of V(T d2 ). Third, note that one could rephrase the condition 
ensuring mixing, as saying that for any m G Z* 2 , either 7^m is eventually in If- 1 , or the 
distance from 7^m to U 1 - goes to 00. Finally we note that when di = d 2 = 2, U = Mr 1 , 
and the 7„'s are automorphisms, the lemma is proved in [2] Lemma 2.2. 

Example 4.4. Let 7 G GL^(Z) be an automorphism of the ci-torus which has an ir- 
reducible characteristic polynomial all of whose roots are real. Then for any subtorus 
U + Z d , the sequence j™Xu converges mixingly to the Haar measure of the full <i-torus. 
In particular, for A^ almost any point v G T d , the orbit {7™f }, is dense in T d . 

We split the argument into two parts. In the first we show that if a subsequence 7™ 4 A;y 
converges to some Ay, then V = M d , which proves that 7 n A[/ converges to X^d. In the 
second we show that the convergence is mixing. We rely on the following two properties 
of 7 (resp. 7'): 

(1) 7 (resp. 7*) is diagonalizable over R with all eigenvalues distinct and of absolute 
value 7^ 1. Hence, for any v G M d , 7 n t> converges projectively to a one dimensional 
eigenspace of 7. 

(2) Any 7-invariant (resp. 7*-invariant) subspace (which is necessarily a direct sum of 
eigenspaces) , is not a subspace of any proper rational subspace (this follows from 
the irreducibility of the characteristic polynomial). 



First step: If rii is such that 7**Aj/ converges to some Xy, then in particular, for any 
v G U, any projective limit of 7 ni t> , is a line in V. But, from (1) it follows that such a 
line is an eigenspace of 7 and property (2) implies that V, which is a rational subspace, 
must equal M. d . 

Second step: We need to show that there could not exist integer vectors m G 7^ 
k G U, such that (7 i ) ra m + k G U for infinitely many n's. From property (1) we deduce 
that (7 i ) n m converges projectively to a 1-dimensional eigenspace of 7* which is contained 
in U (as k is fixed and (7*) n m must diverge), which is a rational proper subspace (as 
U 7^ {0}). This contradicts property (2). 

Example 4.5. Let 7«=^^ J V If C/ = {( x , x ) G 1R 2 : x G M}, then although (j n )*Xu 

converges to the Haar measure of the full 2-torus, 7„ does not mix A^ to Ar2. This is 
because, if we choose m = (1,0)*, then m is a fixed point of 7* , and is not in U^, so we 
see that 7^m is not eventually in U x and its distance from U 1 - does not diverge. Indeed, 
Lemma [O] does not apply and for A^-almost any v G T 2 , the closure {jnv} equals a coset 
of a lower dimensional subtorus. 
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Proof of the Mixing lemma. For any d and m G Z d , let e m (v) = e 27Tt<m > v > : T d — > C. 
Note that in our notation there is no reference to the dimension and the reader should 
understand from the context what is the domain of the character e m . In particular, there 
will be times in which in one equation, two dimensions will be mixed. We need to find 
a rational subspace V <M. d2 , such that (JUL]) holds for any / G C(T d2 ),g G L^Xu). A 
standard argument shows that it is enough to verify the validity of (14. ip when / and g 
are chosen from sets spanning dense subspaces of C(T d2 ) and L l (Xu) respectively. From 
the Stone- Weierstrass theorem it follows that { e m : m G 7L d ' 2 } spans a dense subspace of 
C(T d2 ). Also, the set {ek : k G U} spans a dense subspace in L l {\u). This is because it 
forms an orthonormal basis to L 2 (Xu), which is dense in L 1 (A[/) by the Cauchy Schwarz 
inequality. 

We conclude that we need to find a rational subspace V < M. d2 , such that the following 
convergence holds 

For any meZ^kef/, / e llm+lt (v)dX u (v) (4.2) 



T, 

T d i 



f e 2TTi< 7 im+Kv> dXu ^ = f 
JJ d i JT d i 



e m (j n v)e k (v)dX u (v) (4.3) 
-> / e m (v)dX v (v) I e k (v)dXu(v). (4.4) 

JT d 2 Jf d l 

Given a rational subspace V < M. d and a corresponding Haar measure Ay, a short calcu- 
lation shows that for any m G Z d we have 

w*m.)={j iZZC: < 4 - 5 > 

Working with equation (JUS]), we conclude that for any choice of V, the values in (14.21) . (14.41) 
satisfy 

_ = f 1 if 7 * m + k G f/\ . „ = f 1 if m G V ± and k = 0, 
^^"^ 1 otherwise ' ^—"—^ \ otherwise 

We conclude that given Ay, the sequence 7 n mixes Xu to Ay if and only if the values 
of (14 .2p and (14.41) agree for all large enough n's. This implies immediately the only if part 
of the lemma; if there exist m G Z d2 ,0 / k G U, such that 7^m + k G U L for infinitely 
many n's, then for these values of n, the value of (14. 2p is 1, while the value of (I4.4p is 0. 

We are left to prove the if part. Assume then that for any m G Z d2 and ^ k G U, 
7^m + k is eventually outside U^. We shall conclude the proof by showing that for any 
subsequence of j n , there exists yet another subsequence %, and a subspace V, such that 
7 n mixes A^ to Ay. To see why this concludes the proof, note first that this implies that 
ji = Ay, as we assume that (■y n )*Xu converges to /x, hence any subsequence of it converges 
to jj, too. In particular, this shows that V does not depend on the initial subsequence. 
Second, if 7 n does not mix Xu to Ay, then the above discussion shows that there must 
exist integer vectors m G Z d and k G U, such that for infinitely many n's, the value 
in (14.21) is different from the value in (14.41) ; we obtain a contradiction once we take as the 
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initial subsequence, those 7 n 's for which the values in ( 14. 2 p and ( 14. 4 p do not agree, seeing 
that there could not be a subsequence % of that subsequence, which mixes \u to Ay. 

To this end, let a subsequence of 7„ be given. By a standard diagonal argument, it has 
a subsequence 7„ such that for any m G Z d2 , either 7*m G U ± for all but finitely many 
n's, or 7^m ^ U for all but finitely many n's. Define 

V L = span {m G Z d2 : 7*m G t/" 1 for all large n's} (4.6) 

= span jm G 7L d2 : 7^m G t/ 1 " for infinitely many n's} . 

We remark that the above set of integer vectors spanning V 1 - is clearly a group and in 
fact it is the intersection of V 1 - with Z d2 ; that is, if m G V ± is an integer vector, then 
7* m G U for all large enough n's. 

We now check that along the subsequence j n , the values of (14. 2p . (14. 4p agree for all 
large enough n's, hence concluding that 7 n mixes \u to Ay as desired. There are two 
cases in which the values in (14. 2p . (I4.4p do not agree (with 7 n replaced by j n ). 

(1) Either 7*m + k ^ U L and k = and m G V x , 

(2) or 7* m + k G U 1 - but either k / or m ^ V- 1 . 

It follows from the remark made after the definition of V ± , that ([1]) cannot happen 
infinitely many times. Assume (T5]) holds for infinitely many n's. It cannot be that k ^ 
because of our assumption that 7^m + k is eventually outside U- 1 , for any m and nonzero 
k. On the other hand, it cannot be that k = 0, by our construction of 7„ and the definition 
of V- 1 , as it would mean that eventually 7*m G U , hence m G V^. In any case we arrive 
to a contradiction as desired. □ 

We now use the Mixing lemma to prove another lemma which serves as the main 
tool used to prove Theorem 12.21 As in the case of the mixing lemma, this lemma is of 
independent interest on its own. 

Lemma 4.6 (Coset lemma). Let U < M. dl be a subspace, rational with respect to 1* dl . 
Let 7 n : T dl — > T d2 be a sequence of homomorphisms induced by the matrices 7„ G 
Matd lX d 2 (Z), such that the restrictions 7 n |[/ form an infinite set. Then there exists a 
rational subspace {0} ^ V < M. d2 , such that for Xu-almost any v G T dl , the closure 
{7„f : iGN} contains a coset of the subtorus V + Z d2 

Note that in the above lemma, the subtorus V + 7L d2 is fixed, while its coset depend on 
the initial point. 

Proof. Along the argument we will take subsequences a large number of times. We will 
abuse notation and continue to denote the subsequences by the same symbols. The proof 
goes by induction on d 2 but we first make some general observations. First, by taking 
a subsequence, we may assume that the restrictions j n \u are distinct and that (7„)*A;y 
converges. If there are no integer vectors m G Z d2 , ^ k G U, such that 7^m + k G U 1 - 
for infinitely many n's, then the Mixing lemma implies that 7 n mixes \u to some Ay. Of 
course V ^ {0}, as equality would imply that U = {0} which contradicts our assumption 
that the restrictions j n \u are distinct. Lemma [4721 now tells us that for A^-almost any 
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v G T dl , the closure {^ n v } actually contains the subtorus V + Z* 2 and the lemma follows 
with the additional information that the coset we gain in the closure does not depend on 
the initial point. Assume now that there exits integer vectors m G Z d2 ,0 ^ k 6 [/, such 
that 7* m + k G U 1 - for infinitely many n's. By passing to a subsequence we may assume 
this happens for all n. Note that the validity of the lemma is stable under a change of 
variable, that is if A G SL ( i 1 (Z),5 G SLd 2 (Z) are two automorphisms of T di , then the 
lemma is true for U, 7„, if and only if it is true for A~ X U, B^ n A. It follows that we might 
assume that U is the standard embedding of W in R dl for some 1 < r < d\ (i.e. U is 
the subspace corresponding to the first r coordinates), and that k G U is colinear to ei, 
i.e. there exists an integer I such that k = — le\ = {—£, 0, . . . , 0)' G 7L dx . This is done by 
choosing A properly. By choosing B properly, we see that we may assume that m = e\. 
Indeed, m was chosen to be an integer vector satisfying 7^m + k G U 1 . Any choice of B 
results in replacing the original m by (i?') _1 m. Choosing B properly, we can guarantee 
that m is an integer multiple of e±, but as it can be chosen to be primitive, we may assume 
it actually equals e\. To summarize, if we assume that 7„ does not mix \u to some Ay, 
we may assume we are in the following position: 

(1) The space U equals the standard copy of W in IR dl . 

(2) The matrices 7„ satisfy 7*ei — ie\ G U 1 . This means that the first row of the 
matrix 7„ is of the form [l, 0, . . . , 0, * • • • *). 



The particular shape of the first row of the 7„'s is what enables us to have a reduction 
to a lower dimension and invoke the inductive hypothesis. We now describe the proof by 
induction. 

The case <i 2 = 1: We prove this case by induction on d\. Note that in this case there is 
just one possibility for V, and only one coset; this means that we need to prove that for 
A[/-almost any v G T dl , the sequence r y n v, is dense in T 1 . For d\ — 1, 7„ are simply integer 
numbers and U — R. The fact that we assume that 7„ are all distinct imply that 7„ always 
mixes to itself and we are done as explained above. Assume we know the lemma for 
d < di and d 2 = 1. If 7 n mixes \u then we are done as explained above. If not, then we 
may assume after a change of variable, as explained above, that the first coordinate of 
the matrix 7 n (which is a row vector), is fixed and equals to some integer £. In particular, 
this forces the dimension of U to be > 2, as we assume that the restrictions 7„|(/ are all 
distinct. The lemma now clearly follows from the inductive hypothesis, as given a vector 
v = (vi, . . . , v r , . . . 0)* G U, we know that for almost any v' = (i>2, . . . , v r , . . . 0)*, the 
sequence j^v' is dense in T 1 (here 7^ are the 7„'s after we erase the first coordinate). But 
then we see that for A[/-almost any v G U, the sequence 7„t> = lv\ + ^' n v' is dense in T 1 . 
This finishes the case di — 1 which is the base of our induction. 

The inductive step: Assume we know the lemma when the dimension of the image 
torus is less than d 2 - If 7„ mixes A^ then we are done as explained above. If not, then 
after a change of variable we may assume that the 7„'s are of the form 



r 




(4.7) 
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where p n is a column vector, s n = (0, . . . , 0, * ■ ■ ■ *), and q n are matrices of the appropriate 

r-l 

dimension. Let us denote by 7^ the matrices obtained from 7„ by erasing the first row. 
We view them as homomorphisms from T dl to T d2_1 . The restrictions of 7^ to U (which 
is the copy of IR' r in M. dl ) must all be distinct, as we know this for the 7 n 's, and their first 
row is identical, as far as U is concerned. From the inductive hypothesis we conclude that 
there exists a nontrivial subtorus V + Z d ' 2 ~ 1 in T d2_1 , such that for A^-almost any v G T dl , 
the closure {*y' n v} contains a coset, w' v + (V + Z d2 ~ 1 ), where w' v G R* 2-1 . We conclude that 
for such v 's, the closure {j n v} contains the coset w v + (V + Z d ' 2 ), where w v is obtained 
from w' v by adding to it as a first coordinate lv\. This concludes the proof. □ 

We work out a few examples to develop a feeling about the null set of points which 
corresponds to closures which do not contain a nontrivial coset. A first example is given 
in Example I4.5[ where the points v = (x,xY G U for which the closure of the sequence 
j n v = (x, (n + l)x), in T 2 , does not contain a coset of the subtorus corresponding to the 
second coordinate, are exactly those with x G Q, which clearly form a A[/-null set. As a 
slightly more complicated example we have 

Example 4.7. Let Let 7„ = ( ^ ^ji an d U = IR 2 . Then for v = (x,yY, the closure 

of the sequence 7„t> = (x,n(x + y)) equals a coset of the subtorus corresponding to the 
second coordinate, if and only if v does not lie on a line of the form {v : x + y = q] for 
some q G Q. This countable collection of lines is clearly a A[/-null set but still a dense set 
of dimension 1. 

Example 4.8. Let M be a diagonalizable epimorphism of the ci-torus. When we apply 
the Coset lemma to the case where 7 n = M n , it follows that for Lebesgue almost any v, the 
sequence M n v contains in its closure a full coset of some nontrivial subtorus. Recently it 
was proved in [5] (see also jl]), that for any given w, the set {t> G T d : inf n d(M n v, w) > 0} 
is dense and of full dimension. In fact, they show it is a winning set for Schmidt's game 
(this generalizes Dani's work [8]). In particular, the set of points v G T d which violate 
the conclusion of the Coset lemma, although is null, is still large from other perspectives. 

5. Proofs of Theorem 12.21 and Corollary 12.41 

We begin by rephrasing the Coset lemma, when we let the lattice vary. We shall use 
the following notation: For a grid y — x + w GYj and v G M. d , we denote by y + v, the 
grid x + (w + v). Given a subspace V < M. d , we let y + V = {y + v : v G V} C 7r~ 1 (x). 
Thus y + V is simply the coset of V passing through the grid y, in the torus 7r _1 (x). 

Lemma 5.1. Let X\,X2 G Xd be two lattices and h n G G be such that h n x\ — > x-i- Let 
U <M, d be a rational subspace with respect to x\ and w G ffi rf be given. Assume that h n is 
not almost finite with respect to U (in the sense of Definition 11.5] ). Then, there exists a 
subspace {0} 7^ V < R d , rational with respect to x 2 , such that for X w+u -almost any grid 
y G 7r _1 (xi) ; the closure {h n y : n G N} ; contains a coset z y + V , for some z y G it~ 1 (x2). 
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Proof. We first note that after passing to a subsequence, we may assume that h n (w + xi) 
converges to some grid in 7r _1 (a;2). This clearly reduces the lemma to the case where we 
are dealing with a Haar measure of a subtorus Xu, rather than a translate of it X w+ u- 

Let gi G G be such that x« = g^T for i = 1,2. There exists a sequence e n G G, with 
e n — > e such that e n h n x\ = X2, or in other words, 7„ = g2 l e n h n gi G T. Let [/' = g^U. 
Then [/' is rational with respect to Z d . The fact that h n is not almost finite with respect 
to U, translates to the fact that the restrictions ^f n \u' form an infinite set. Hence, the 
Coset lemma applies and gives us the existence of a subspace {0} ^ V < lR d , rational 
with respect to Z d , such that for A;y-almost any grid y' of Z d , the closure of the sequence 
j n y' contains a coset of the subtorus V' + Z d . This translates to the fact that for V = g 2 V' ', 
and Af/-almost any grid y G 7r _1 (xi), the closure of the sequence e n h n y, contains a coset 
of the subtorus V + x 2 . This implies the same statement for the sequence h n y, as e n —> e, 
and the lemma follows. □ 

Before turning to the proof of Theorem 12.21 we note that in the terminology introduced 
at the beginning of this section, we have that a continuous function F : ~R d — > M. is 
nondegenerate in the sense of Definition II. 44 if and only if, for any grid y G and any 
subspace {0} ^ V < R d one has U y , &{y+v] V F {y') = F(R d ). 

Proof of Theorem \2.B. Let F, x, U, w, and h n , be as in the statement. Denote x = X\ 
and \\mh n Xi = x 2 - Lemma [5.11 implies that there exists a nontrivial subspace V < M. d , 
such that for A w+ {/-almost any y G 7r _1 (xi), the closure Hpy, contains a full coset z y + V, 
for z y G 7r _1 (x2). The Inheritance lemma (Lemma 11.21) implies now that such a grid 
satisfies 

V F (y) D {J y , e{Zy+v} V F (y') = F(R d ), 
which concludes the proof. □ 



Proof of Corollary 2.4 ■ Let x, U, and w, be as in the statement. From the classification 
of divergent Hp-OTbits (see [21] and $2]), we see that as x does not contain any vectors on 
the axes, it has a nondivergent ifp-orbit. Let h n G H F be a diverging sequence such that 
h n x converges. We only need to argue why the sequence h n cannot be almost finite with 
respect to U . Indeed, assume that h n is almost finite with respect to U . After passing to 
a subsequence and possibly permuting the coordinates (which we ignore in order to ease 

our notation), we can assume that h n = diag ^e'i e'^ j , where for some 1 < r < d 

we have that tf 1 ' diverge for % < r and converge for i > r. It follows that as h n is almost 

finite with respect to U, we must have U < ^ (0, . . . , 0, * • • • *)* G lR d 1 . This leads to a 

contradiction, as U is rational with respect to x, and we assumed that x does not contain 
any point on the hyperplanes of the axes. □ 
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